Resumen:
In this work, we applied a stochastic simulation methodology to quantify the power of the detection of outlying mixture components of a stochastic model, when applying a reduced-dimension clustering technique such as Self-Organizing Maps (SOMs). The essential feature of SOMs, besides dimensional reduction into a discrete map, is the conservation of topology. In SOMs, two forms of learning are applied: competitive, by sequential allocation of sample observations to a winning node in the map, and cooperative, by the update of the weights of the winning node and its neighbors. By means of cooperative learning, the conservation of topology from the original data space to the reduced (typically 2D) map is achieved. Here, we compared the performance of one- and two-layer SOMs in the outlier representation task. The same stratified sampling was applied for both the one-layer and two-layer SOMs; although, stratification would only be relevant for the two-layer setting—to estimate the outlying mixture component detection power. Two distance measures between points in the map were defined to quantify the conservation of topology. The results of the experiment showed that the two-layer setting was more efficient in outlier detection while maintaining the basic properties of the SOM, which included adequately representing distances from the outlier component to the remaining ones.
Palabras Clave: self-organizing map; neural networks; robustness; nonlinear projections; dimensionality reduction; deep semisupervised learning
Índice de impacto JCR y cuartil WoS: 2,838 - Q2 (2021); 2,500 - Q1 (2023)
Referencia DOI: https://doi.org/10.3390/app11146241
Publicado en papel: Julio 2021.
Publicado on-line: Julio 2021.
Cita:
G. Valverde, J.M. Mira McWilliams, B. González-Pérez, One-layer vs. two-layer som in the context of outlier identification: a simulation study. Applied Sciences. Vol. 11, nº. 14, pp. 6241-1 - 6241-22, Julio 2021. [Online: Julio 2021]